# Multi-Task Generalization
Spatialvla 4b 224 Sft Bridge
MIT
This model is a vision-language-action model fine-tuned on the bridge dataset based on the SpatialVLA model, specifically designed for the Simpler-env benchmark.
Text-to-Image
Transformers English

S
IPEC-COMMUNITY
1,066
0
Calme 3.2 Instruct 78b
Other
calme-3.2-instruct-78b is an advanced iterative version based on Qwen2.5-72B, a general-domain large language model with enhanced capabilities through self-merging and fine-tuning.
Large Language Model
Transformers English

C
MaziyarPanahi
2,212
127
Featured Recommended AI Models